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1 .  OVERVIEW 

This  annual  report  describes  the  work  performed  during  the 
first  year  of  a  two-year  effort.  The  overall  goal  of  the  project 
is  to  develop  improved  techniques  for  acoustic-phonetic 
recognition,  with  eventual  application  to  automatic  speech 
understanding . 

Earlier  work  on  this  subject  has  vielded  phonetic 
recognition  rates  of  60-70%.  We  estimate  that  a  minimum 
recognition  rate  of  80%  is  needed  for  a  hiqh-performance  speech 
understanding  system.  We  believe  that  the  information  needed  to 
achieve  the  higher  recognition  accuracy  is  available  in  the 
speech  signal.  This  research  is  directed  towards  discovering  as 
many  context-sensitive  acoustic-phonetic  rules  as  possible  to 
improve  the  recognition  performance. 

This  first  year  of  research  was  devoted  to  three  major 
activities:  Speech  database  generation,  computer  tool  building, 

and  development  of  acoustic-phonetic  rules.  Specifically,  the 
following  tasks  were  performed: 

1.  Designed,  recorded,  digitized  and  processed  a  database 
of  speech. 

2.  Converted  and  extended  several  large  utility  programs 
that  had  run  on  the  PDP10  to  run  on  the  VAX  11/780. 

3.  Implemented  spectrogram  display  programs  for  the  Benson 
Varian  printer  and  the  BBN  BitGraph  display  terminal. 

4.  Examined  a  selected  number  of  spectrograms  and 
catalogued  acoustic-phonetic  rules. 

5.  Initiated  the  development  of  a  set  of  context-dependent 
acoustic-phonetic  features  to  distinguish  among 
phonemes  with  greater  accuracy. 
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In  the  following  sections,  we  describe  each  of  these  tasks 
in  greater  detail. 

To  aid  in  our  acoustic-phonetic  work,  a  real-time 
spectrograph  display  was  purchased  from  Spectraphonics.  The 
real-time  spectrograph  has  been  useful  in  looking  at  spectrograms 
and  discovering  certain  acoustic-phonetic  properties. 
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2 .  DATABASE 

To  enable  us  to  discover  rules  for  phonetic  recognition  that 
are  applicable  to  a  wide  range  of  phonetic  environments,  we 
designed  a  large  database  of  sentences.  The  initial  database 
consists  of  110  different  sentences.  Phonetic  transcriptions  of 
the  sentences  were  analyzed  statistically  to  ensure  that  they 
represented  aphonetic  balance  similar  to  that  in  fluent  everyday 
speech.  Appendix  1  contains  a  list  of  the  110  sentences.  It 
also  indicates  the  word  count  and  phoneme  counts  used  in  the 
phonetic  analysis.  Each  sentence  was  recorded  by  four  male 
speakers.  The  digitized  speech  waveforms  were  then  analyzed 
using  our  Primary  Speech  Analysis  (PSA)  program,  which  had  been 
converted  to  run  on  the  VAX  computer.  The  output  of  the  PSA 
program  is  a  large  set  of  acoustic  parameters  such  as  the  energy 
in  different  frequency  bands,  zero-crossing  count,  formant 
frequencies,  etc.,  for  each  10  ms  frame.  These  parameters  will 
be  used  by  the  Acoustic-Phonetic  Recognition  (APR)  program  that 
will  be  developed  under  this  project. 

We  have  just  begun  to  align  the  phonetic  transcription  of 
each  of  the  sentences  with  the  parameters.  This  will  allow  us  to 
use  the  Acoustic-Phonetic  Experiment  Facility  ( APEF)  to  discover 
more  rules  and  features  by  looking  at  all  occurrences  of  any 
phonetic  sequence  within  the  database. 

To  augment  our  new  database,  we  have  also  transferred  the 
phonetically  transcribed  database  that  had  been  used  in  the  ARPA 
Speech  Understanding  Research  (SUR)  Project  to  the  VAX. 
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3.  UTILITY  PROGRAMS  CONVERTED 

Our  move  from  the  PDP-10  computer  to  the  VAX-11/780  has 
necessitated  the  conversion  of  all  of  our  utility  programs  that 
had  been  developed  over  the  past  several  years.  The  move  to  the 
VAX  computer  was  necessary  as  the  VAX  system  provides  both  real¬ 
time  processing  capabilities  (with  the  aid  of  the  FPS-AP120B  and 
the  DEC  LPA-11K  systems)  and  interactive  computing  on  a  computer 
with  a  large  virtual  address  space.  Both  of  these  capabilities 
are  necessary  for  the  efficient  performance  of  our  research. 

Below  we  describe  the  programs  that  have  been  transferred 
(and  rewritten  when  necessary) . 

3.1  Library 

We  have,  over  the  years,  developed  an  extensive  subroutine 
library  with  routines  that  perform  a  wide  variety  of  signal 
processing  functions,  string  manipulation  functions,  user 
interaction  functions,  etc.  As  a  first  step  to  developing 
programs  on  the  VAX  computer,  we  transferred  or  rewrote  all  of 
the  generally  useful  subroutines  in  our  user  library.  Appendix  2 
contains  a  list  of  the  routines  in  the  library,  with  a  short 
description  for  each. 

To  make  the  library  more  useful,  we  used  the  on-line 
documentation  facility  provided  by  the  VAX  VMS  operating  system, 
together  with  a  tree  structured  documentation  file. 
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3.2  RTIME 

This  program,  which  had  resided  on  our  PDP-11/40  system  and 
was  converted  to  run  on  the  VAX,  is  used  for  real-time  signal 
acquisition  and  playback,  waveform  editing,  and  spectral  display. 
Although  the  original  PDP-11  program  and  the  new  VAX  program  are 
both  in  FORTRAN,  the  program  had  to  be  almost  completely 
rewritten  due  to  the  differences  in  I/O,  real-time  systems,  FPS 
software  package,  and  operating  system  facilities.  The  program 
was  also  modified  to  enable  recording  of  large  databases  with 
improved  user-machine  interaction  and  automatic  editing  of 
silences . 

One  important  type  of  speech  database  is  characterized  by 
large  numbers  of  individual  sentences.  Extracting  these  from  a 
previously  recorded  audio  tape  is  a  time  consuming  and  laborious 
task.  Consequently,  we  implemented  a  semi-automatic  sentence 
acquisition  capability  into  the  RTIME  program.  In  the  most 
commonly  used  mode  of  operation,  the  program  is  given  a  list  of 
file  names  for  the  output.  The  program  then  prompts  the  user 
with  the  name  of  the  file  and  waits  for  a  carriage  return.  The 
speaker  types  the  carriage  return,  and  whenever  ready,  speaks  the 
sentence.  The  program  automatically  detects  the  beginning  and 
end  of  speech,  plays  back  the  edited  waveform,  and  asks  for 
verification.  If  verified,  it  writes  out  the  file  and  goes  on  to 
the  next  sentence.  Otherwise,  it  allows  many  other  operations 
such  as  playing  the  file  again,  recording  it  again,  or  allowing 
the  user  to  override  the  end  points  chosen  by  the  program. 
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3.3  PSA 

The  Primary  Speech  Analysis  (PSA)  program  performs  a  wide 
variety  of  functions.  First,  it  accepts  as  input  a  digitized 
speech  waveform,  and  analyzes  it  to  produce  a  large  set  (50)  of 
acoustic  parameters.  The  parameters  computed  include  several 
energy  parameters  derived  from  the  LPC  spectrum,  the  formant 
frequencies  (also  derived  by  algorithm  from  the  narrow-bandwidth 
LPC  poles) ,  and  other  useful  parameters  such  as  the  z  o-crossing 
count  and  spectral  center  of  mass.  The  speech  pe  meters  and 
manual  transcriptions  produced  by  this  program  are  tb  used  by 
the  APEF  and  phonetic  recognition  programs  (described  'r  ow)  . 

The  program  is  capable  of  displaying  the  most  commonly  used 
parameters  using  the  I ML AC  display  computer.  It  also  displays  a 
section  of  the  speech  waveform  and  the  LPC  power  spectrum  at  a 
time  indicated  by  a  cursor.  By  allowing  the  simultaneous  display 
of  all  the  available  information,  the  program  allows  a  user  to 
develop  intuitions  for  distinguishing  among  phonemes.  The 
program  is  also  used  for  the  interactive  manual  transcription  of 
speech  material. 

Since  PSA  consists  largely  of  I/O  functions  and  interactions 
with  the  I ML AC  display  computer,  the  program  had  to  be  completely 
rewritten.  The  program  had  evolved  over  a  period  of  10  years, 
and  this  was  a  good  opportunity  to  restructure  the  many  functions 
in  a  well-organized  manner.  The  new  program  was  written  in 
FORTRAN-77,  which  is  a  block  structured  language,  making  this 
redesign  straightforward. 
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3 . 4  APEF 

The  Acoustic-Phonetic  Experiment  Facility  (APEF)  is  a  highly 
interactive  system  that  allows  a  speech  researcher  to  perform  a 
wide  variety  of  experiments  on  a  large  database  of  continuous 
speech  [1] •  This  system,  which  has  been  used  extensively  during 
our  previous  phonetic  research,  is  essential  to  the  development 
of  a  large  set  of  acoustic-phonetic  features  and  rules  for 
recognizing  the  different  phonemes  of  English. 

APEF  was  written  in  BCPL  on  the  PDP-10,  so  it  had  to  be 
translated  to  a  language  that  exists  on  the  VAX.  Several 
languages  were  considered  for  this  translation,  including  PASCAL, 
PRAXIS,  a  subset  of  ADA,  FORTRAN -7 7 ,  and  C.  C  was  chosen  since  it 
is,  by  far,  the  language  that  is  most  similar  to  BCPL.  C  has 
recently  become  available  on  VAX  VMS  syrtems,  and  is  compatible 
with  all  other  VMS  languages.  The  translation  of  12,000  lines  of 
BCPL  code  into  15,000  lines  of  C  code  has  been  completed.  At 
present,  some  of  this  very  large  program  has  been  debugged  and  is 
operating  correctly.  However,  there  are  still  several  essential 
modules  that  have  not  been  debugged  as  yet.  This  debugging 
process  should  be  completed  during  the  coming  year. 

Since  this  program  will  be  used  so  extensively  in  the 
research  under  this  project,  we  present  below  a  brief  description 
of  its  capabilities.  The  primary  use  of  the  program  is  to 
examine  many  occurrences  of  some  set  of  phonetic  phenomena 
extracted  from  a  large  database  of  continuous  speech,  and  then  to 
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examine  this  data  statistically ,  and  develop  algorithms  for 
phonetic  recognition  without  the  need  to  incorporate  these 
algorithms  in  a  complete  phonetic  recognition  program.  The  user 
first  specifies  a  phonetic  context  of  interest.  For  example,  he 
might  be  interested  in  distinguishing  among  the  unvoiced  plosives 
when  followed  by  vowels  or  glides.  The  phonetic  context  is 
specified  in  terms  of  a  sequence  of  segment  descriptions.  These 
descriptions  can  be  given  in  terms  of  phonemes,  classes  of 
phonemes,  or  boolean  combinations  of  classes.  The  program  also 
allows  the  user  to  specify  the  context  in  terms  of  the  presence 
or  absence  of  word  boundaries,  syllable  boundaries,  different 
vowel  stress  levels,  or  particular  words.  It  also  allows  for  the 
designation  of  some  of  the  segments  being  optional. 

Once  the  phonetic  context  has  been  specified,  the  user 
enters  a  procedure  that  will  compute  several  acoustic-phonetic 
features.  The  procedure  is  given  in  terms  of  a  list  of  high- 
level  routines.  The  routines  include  a  wide  variety  (52)  of 
functions  for  examining  acoustic  parameters  (e.g.,  average, 
maximum,  minimum,  standard  deviation,  etc.),  arithmetic  functions 
(e.g.,  sum,  difference,  product,  quotient,  abs,  max,  min,  etc.), 
boolean  functions  (e.g.,  and,  not,  or),  loops,  branches,  tests, 
display  functions,  and  computation  of  new  acoustic  parameters 
from  the  raw  power  spectrum. 

The  user  then  instructs  the  program  to  search  the  database 
for  occurrences  of  the  specified  context.  For  each  occurrence 
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found,  the  procedure  is  run  to  compute  the  desired  features. 
Then,  the  user  is  given  a  variety  of  ways  of  examining  the  data 
gathered.  The  program  is  capable  of  producing  various 
statistical  displays  on  the  IMLAC  display  processor,  giving 
statistical  information  for  a  specified  feature,  and  performing 
pattern  recognition  experiments  on  the  gathered  data  to  attempt 
to  distinguish  among  the  different  phonemes  using  the  features 
gathered. 

The  end  result  is  that  an  initial  experiment  can  be 
performed  in  just  a  few  minutes,  and  easily  modified,  making  it 
possible  to  fully  investigate  a  particular  phonetic  recognition 
problem  in  a  few  hours. 

3.5  IMLAC  Display  Interface 

The  IMLAC  display  interfaces  that  had  been  developed  on  the 
PDP10  were  reworked  so  that  the  VAX  could  communicate  with  the 
IMLAC  in  the  same  ways  as  the  PDP10.  This  conversion  included 
the  special  purpose  IMLAC  programs  used  by  RTIME  (SPECIZ) ,  which 
supports  the  scrolled  waveform  display  and  two  spectrum  displays. 
The  PSA  program  also  uses  a  special  purpose  IMLAC  display  program 
(PSAI) ,  which  allows  the  simultaneous  display  of  14  speech 
parameters,  a  power  spectrum,  and  a  section  of  a  speech  waveform. 
It  also  includes  several  2-way  interactions  between  the  host 
computer  and  the  IMLAC  computer,  controlled  by  various  knobs  and 
switches  connected  to  the  IMLAC.  Finally,  we  brought  up  the 
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general  purpose  display  package/  IMSYS,  that  is  used  by  APEF  and 
other  programs. 
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4 .  DISPLAY  PROGRAMS 

We  have  developed  several  display  programs  for  the  VAX 
computer.  These  display  programs  fall  into  two  categories: 

1.  Spectrogram  Displays 

2.  Speech  Parameter  Displays 

We  have  developed  a  spectrogram  display  program  that 
produces  a  hardcopy  display  of  a  spectrogram  on  the  Benson  Varian 
printer,  which  is  connected  to  the  VAX  computer.  This  program 
uses  the  FPS  for  the  signal  processing  and  image  processing 
necessary  to  produce  a  pleasing  visual  display.  We  have  also 
developed  a  version  of  the  program  that  produces  a  spectrogram  on 
the  BBN  BitGraph  terminal.  These  interactive  programs  are 
intended  to  complement  the  capabilities  of  the  Spectraphonics 
Real-Time  Spectrograph.  In  particular,  the  new  programs  also 
allow  the  display  of  several  acoustic  parameters  adjacent  to  the 
spectrogram,  on  the  same  time  scale,  using  the  PARPLOT  program 
discussed  below. 

The  PARPLOT  program  was  developed  to  make  hardcopy  displays 
of  the  speech  parameters  (computed  by  PSA)  on  the  Benson  Varian 
printer.  These  parameter  displays  are  scaled  such  that  they  are 
compatible  with  the  spectrogram  displays.  The  program  allows  the 
user  to  specify,  in  an  interactive  way,  which  parameters  are  to 
be  plotted,  and  with  what  scales.  These  hardcopy  plots  will  be 
used  to  help  design  the  acoustic-phonetic  recognition  algorithms. 
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5.  ACOUSTIC-PHONETIC  RULES/ FEATURES 

One  of  the  primary  research  tasks  performed  this  year  was  to 
develop  context-dependent  acoustic-phonetic  rules  for 
distinguishing  among  phonemes  with  greater  accuracy  than 
previously  possible.  For  each  class  of  phonemes,  we  began  by 
examining  spectrograms  of  those  phonemes  in  different  contexts. 
The  Spectraphonics  was  particularly  useful  for  this  process, 
since  it  allowed  us  to  generate  spectrograms  for  a  phoneme  in 
many  contexts  with  a  minimum  of  effort.  An  important  source  of 
ideas  for  useful  features  was  obtained  from  our  interactions  with 
the  research  group  at  MIT.  When  an  idea  was  mentioned,  we  were 
able  to  verify  it  quickly,  using  the  spectraphonics.  Most  often, 
the  rules  developed  from  looking  at  spectrograms  are  qualitative 
in  nature,  and  are  also  based  on  a  very  small  number  of  samples. 
Therefore,  we  then  used  APEF  to  develop  specific  quantitative 
rules  and  tested  them  on  a  large  database. 

While  the  VAX  version  of  APEF  is  being  translated  and 
debugged,  we  have  been  using  the  PDP-10  version  of  the  program 
with  our  old  database  of  labeled  speech  to  continue  to  find  rules 
and  features  that  are  useful  for  acoustic-phonetic  recognition. 
The  rules  that  we  are  developing  are  of  the  type  that  would  be 
appropriate  for  addition  to  the  Acoustic-Phonetic  Recognition 
(APR)  program  that  is  also  being  written.  Most  of  the  effort 
with  APEF,  however,  has  been  directed  towards  finding  features 
that  are  roost  useful  for  distinguishing  among  phonemes  that  are 
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within  certain  classes.  We  have  found  that  it  is  possible  to 
distinguish  among  particular  phonemes  within  a  class  (for  example 
among  the  three  nasal  phonemes,  or  the  three  unvoiced  plosive 
phonemes)  with  80%-90%  accuracy.  This  accuracy  is  achieved  by 
searching  for  features  that  are  fairly  independent  of  the 
phonetic  context  (adjacent  phonemes) ,  and  also  by  considering 
separate  phonetic  contexts  where  appropriate. 

Typically,  the  features  are  made  independent  of  the  phonetic 
context  by  normalizing  them  by  some  measurement  made  during  the 
adjacent  phoneme.  For  example,  one  strong  cue  to  the  identity  of 
a  nasal  is  the  direction  of  the  formant  transitions  in  the 
preceding  vowel.  The  second  and  third  formants  tend  to  drop 
preceding  an  [M] ,  rise  preceding  an  [N] ,  and  come  together 
preceding  and  [NX].  However,  the  actual  amount  and  direction  of 
the  transition  depends  on  the  previous  vowel.  Figure  1  shows 
density  distributions  of  the  derivative  of  F3  in  the  two  frames 
preceding  an  [M]  or  an  [N] .  (The  figures  shown  were  produced  by 
the  APEF  program  and  transferred  directly  from  the  IMLAC  screen 
to  the  Benson-Varian  printer  plotter.)  As  can  be  seen,  the 
overlap  in  the  distributions  for  the  two  phonemes  is  quite  large, 
making  this  (commonly  quoted)  feature  quite  useless.  However,  by 
normalizing  the  amount  of  formant  change  by  the  measured  formant 
frequencies  in  the  middle  of  the  vowel,  the  transitions  are  made 
much  more  useful  in  distinguishing  among  the  nasals,  without 
having  to  know  exactly  what  the  preceding  vowel  was.  Figure  2 
shows  density  distributions  of  the  adjusted  derivative  values. 
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CONTEXT-DEPENDENT  FEATURES 
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A  ct  j  us  ted  Del'  i  vat  i  ve  of  F3  foh  M  / 
Fig.  2.  Derivative  is  Adjusted  by  Vowel  Formant 
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As  can  be  seen,  the  overlap  between  the  distributions  has  been 
substantially  decreased.  A  similar  effect  is  shown  for  the 
derivative  of  F2  in  Figures  3  and  4.  While  each  of  these 
features  is  not  sufficient  to  separate  the  nasals  with  high 
accuracy,  taken  together  with  other  useful  features,  the  phoneme 
recognition  can  be  quite  high.  Figure  5  shows  the  results  of  an 
APEF  experiment  using  four  features  to  distinguish  among  the 
three  nasals.  The  cases  considered  included  all  the  nasals 
preceded  by  any  vowels.  There  were  a  total  of  339  such  nasals  in 
the  database.  In  addition  to  the  two  adjusted  formant 
derivatives,  the  average  F2  and  F3  during  the  nasal  were  used  as 
features.  The  resulting  performance  was  87%  correct  on  the  first 
choice,  and  98%  on  the  second  choice.  The  confusion  matrix  for 
the  first  choice  is  shown  at  the  bottom  of  the  figure. 

Figure  6  shows  the  improvement  in  performance  when  using 
context-dependent  features,  and  when  considering  the  phoneme 
context  in  the  decision.  The  first  set  of  results  shows  that  the 
accuracy  when  using  the  unnormalized  derivatives  and  averages  is 
84%.  Using  the  adjusted  derivatives  increases  the  performance  to 
87%.  Finally,  we  considered  adding  another  feature  -  the  F2-F3 
distance  in  the  frame  before  the  nasal.  This  distance  should  be 
small  for  [NX]  and  larger  for  the  other  nasals.  As  shown,  the 
performance  actually  decreases  to  86%,  due  to  the  fact  that  this 
formant  distance  can  also  be  affected  considerably  by  the 
preceding  vowel.  Consequently,  we  considered  the  cases  of  nasals 
preceded  by  those  vowels  with  close  F2-F3  distance 
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CONTEXT-DEPENDENT  FEATURES 
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B 

USE  OF  CONTEXT-DEPENDENT  FEATURES 


B 

CONTEXT:  (VOWEL)  (NASAL) 
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FEATURES: 

1)  Adjusted  F2  Derivative 

2)  Adjusted  F3  Derivative 

3)  Average  F2  during  Nasal 

4)  Average  F3  during  Nasal 


RESULT:  87%  correct;  98%  on  second  choice 
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IMPROVEMENT  ACHIEVED  BY  USING 
PARAMETER  AND  PHONETIC  CONTEXT 
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( [ IY,EY, AY,OW,OY,AW,ER,AXR] )  separately  from  all  other  vowels. 
As  shown,  the  performance  in  these  two  cases  when  considered 
separately  increased  to  92%  and  90%  respectively.  This 
recognition  performance  among  the  nasals  is  quite  high,  compared 
to  results  reported  in  the  literature,  and  compared  to  results 
previously  obtained  by  us. 

We  have  shown  that  by  performing  the  recognition  under 
separate  hypotheses  for  the  context,  the  separation  of  phonemes 
within  a  class  can  be  improved  significantly.  Although  the  APR 
program  cannot  reliably  determine  the  phonetic  context,  the  word 
matching  component  in  the  speech  recognition  system  has  available 
the  complete  spelling  of  any  word  being  hypothesized  by  the  rest 
of  the  system.  It  can  then  use  only  the  scores  assigned  under 
the  correct  hypothesis  for  the  context.  This  same  logic  also 
applies  to  a  program  trying  to  determine  the  phonetic  identity  of 
a  sentence  (e.g.,  a  phonetic  vocoder),  since  when  evaluating  any 
phoneme  hypothesis,  the  program  can  take  into  account  the 
previous  phonemes  in  the  theory  under  consideration. 
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APPENDIX  1 

SENTENCE  LIST  AND  PHONETIC  ANALYSIS 
FOR  NEW  DATABASE 


Final  ARPA/ONR  sentences,  March  19,  1982 


1  Halve  the  height  of  the  display  window. 

2  Show  the  result  below  and  to  the  left  of  the  graph. 

3  I  think  someone's  made  a  mistake. 

4  Draw  an  equilateral  triangle  around  the  equation. 

5  How  far  is  the  throughway  from  the  railroad  terminal? 

6  Compute  a  spectrogram  and  display  it. 

7  Which  utterance  contains  the  fewest  sonorants? 

8  These  two  equations  are  inconsistent. 

9  Reduce  the  abscissa  to  a  twelfth  of  its  present  length. 

10  Exchange  the  positions  of  the  graph  and  the  text. 

11  Is  there  a  good  manuscript  on  this  topic  in  the  library? 

12  I  want  to  edit  the  latest  version  of  the  proposal. 

13  Before  showing  that  data,  check  its  security  classification. 

14  There  should  be  a  subroutine  to  do  that  automatically. 

15  Whose  decision  was  that? 

16  Find  the  average  of  the  smallest  values. 

17  Calculate  the  sample's  standard  deviation. 

18  What  happened  to  the  missing  data? 

19  What's  the  temperature  in  New  York  City? 

20  Is  there  a  good  flight  available  in  the  reverse  direction? 

21  Is  the  weather  forecast  for  tomorrow  better? 

22  Can  the  budget  be  stretched  to  cover  these  costs? 

23  Plot  both  functions  in  the  same  figure. 

24  Switch  the  order  of  these  two  paragraphs. 

25  Put  the  title  at  the  top  of  the  page. 

26  Capitalize  only  the  proper  names. 

27  There  should  be  a  comma  after  the  third  word  in  this  sentence. 

28  Remove  the  opening  quotes  from  the  next  three  phrases. 
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29  How  much  was  budgeted  for  travel  in  the  second  quarter? 

30  When  is  the  next  progress  report  due  in  Washington? 

31  Plot  one  function  immediately  above  the  other. 

32  Isn't  there  some  other  way  to  show  the  structure  in  the  data? 

33  Indent  alternate  lines  in  this  paragraph. 

34  Show  me  all  the  expenses  approved  so  far. 

35  How  many  purchase  orders  are  outstanding? 

36  Why  has  his  trip  report  not  been  submitted? 

37  Are  there  any  other  late  trip  reports? 

38  Can  we  close  the  books  on  the  project's  first  phase  now? 

39  The  RFP  contained  a  detailed  work  statement. 

40  Who  do  we  have  free  to  work  on  this  .task? 

41  Reschedule  the  remainder  of  the  tasks  as  well. 

42  Who's  project  manager  for  this  effort? 

43  Did  the  accounting  system  screw  up  again? 

44  Has  a  critical  path  analysis  been  done  for  this  job? 

45  Which  jobs  are  furthest  behind  schedule? 

46  Arrange,  a  meeting  of  all  task  managers  for  Tuesday. 

47  Have  the  replacement  parts  be'en  delivered  yet? 

48  Top  management  will  have  to  decide  on  that  question. 

49  The  system  crash  destroyed  most  of  our  records. 

50  Were  the  records  archived,  and  could  we  retrieve  them? 

51  You'd  better  consult  a  lawyer  about  our  liability. 

52  Print  up  a  financial  summary  sheet. 

53  Can't  we  avoid  an  overrun  on  the  rest  of  the  tasks? 

54  Print  the  last  four  messages. 

55  Is  there  any  mail  in  my  box? 

56  Show  me  message  seventy  two. 
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57  Delete  the  first  twenty  eight  messages. 

58  Send  message  fifty  seven  to  everyone  in  the  group. 

59  Clear  the  screen. 

60  Open  my  mail  file. 

61  Archive  all  deleted  messages. 

62  What  messages  have  I  had  from  ISI  this  year? 

63  Print  a  draft  of  this  chapter. 

64  How  many  pages  still  need  editing? 

65  Who  at  BBN  is  working  on  VLSI? 

66  Where  and  when  is  the  ARPA  review  meeting? 

67  Close  this  file. 

68  A  hundred  and  thirty  one  pages  is  too  long. 

69  Tell  Dwayne  we  don't  anticipate  a  cost  overrun. 

70  Who's  on  vacation  now? 

71  Who  deleted  the  backup  files? 

72  There's  room  for  five  more  lines  in  the  footnote. 

73  Is  there  a  concept  called  "Mailing  Address,"  or  something  close? 

74  What  concepts  have  roles  with  role-names  like  "Nickname?" 

75  Show  me  the  generic  concept  called  "Employee." 

76  I  can't  fit  a  new  label  below  it. 

77  Could  you  move  it  up? 

78  Now  make  an  individual  employee  concept. 

79  The  first  name  is  "Gordon,"  and  the  last  name  is  "Hook." 

80  His  Social  Security  number  is  2  5  4,  3  6,  7  9  zero  8. 

81  Is  there  a  role  on  employee  called  "Retirement  Fund?" 

82  How  about  a  role  called  "Pension  Program,"  or  "Pension  Plan?" 

83  I'd  like  to  see  the  structure  below  "Employee  Benefits." 

84  Empty  the  screen  and  save  the  current  display. 
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85  What’s  the  book  say  on  old  age  benefits? 

86  Is  it  a  dollar  amount? 

87  Set  the  value  of  old  age  benefits  to  twenty  five  thousand  dollars. 

88  Can  you  display  just  the  role-set? 

89  O.K.,  use  the  old  role. 

90  No,  change  that. 

91  Display  the  role-set  and  the  old  role. 

92  Now  I  need  some  figures  on  overhead  and  IR&D. 

93  I'd  like  to  see  the  representation  of  "Companies." 

94  Thanks. 

95  Now  show  me  the  concept  for  "Persons." 

96  I  need  your  help  on  layout  decisions. 

97  Change  the  role-name  to  "Residence." 

98  Could  you  put  the  role-value  map  here. 

99  Redefine  this  concept  to  include  individuation. 

100  Insert  a  telephone  number  of  6  43,  6638. 

101  That  should  read  twelve  thousand  and  three,  not  eleven  thousand. 

102  Make  that  forty  nine  hundred. 

103  There  should  be  seventeen  entries. 

104  Twenty  K  sounds  about  right. 

105  Order  some  more  Form  fourteen  seventy  threes. 

106  Type  up  six  copies  of  the  abstract. 

107  A  block  is  two  fifty  six  words,  right? 

108  Change  this  entry  to  eight  thousand. 

109  Would  you  put  this  box  to  this  side. 

110  Interchange  these  two  tables. 
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Fortran  Subroutine 

Source:  DRAI:  l SPEECH .  1. 1  DRARY  J  FFTR .  FOR . 

Subprogram  Definition: 

SUBROUT I NE  FFTR ( N2P0V , NS . S , TR , T I) 

FAST  FOURIER  TRANSFORM  OF  REAL  VECTOR  (RADIX  8-4-2) 
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Source:  DRA 1 :  (SPEECH. LIBRARY  J  I NTONE . FOR . 

Subprogram  Definition: 

integer  function  INTONEt  iher  t7.) 

I'orlran  integer  function  to  convert  from  hertz,  to  tones. 
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Subprogram  Definition: 

cltnrnr  ter’  <’  )  function  ItSTH  IN(»(promp  l_s  tr  i  ng  ) 


